Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 97306 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 1 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 5.8 MiB |
| Average record size in memory | 62.0 B |
Variable types
| Numeric | 9 |
|---|---|
| Boolean | 4 |
| Categorical | 1 |
| Dataset has 1 (< 0.1%) duplicate rows | Duplicates |
alcohol_use is highly overall correlated with cigarette_use | High correlation |
cigarette_use is highly overall correlated with alcohol_use | High correlation |
father_age is highly overall correlated with mother_age | High correlation |
mother_age is highly overall correlated with father_age | High correlation |
plurality is highly imbalanced (89.8%) | Imbalance |
cigarette_use is highly imbalanced (77.6%) | Imbalance |
alcohol_use is highly imbalanced (77.2%) | Imbalance |
baby_alive is highly imbalanced (61.2%) | Imbalance |
ever_born is highly skewed (γ1 = 71.8577652) | Skewed |
Reproduction
| Analysis started | 2024-07-30 18:05:09.990214 |
|---|---|
| Analysis finished | 2024-07-30 18:05:38.922386 |
| Duration | 28.93 seconds |
| Software version | ydata-profiling vv4.9.0 |
| Download configuration | config.json |
year
Real number (ℝ)
| Distinct | 40 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2004.374 |
| Minimum | 1969 |
|---|---|
| Maximum | 2008 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 380.2 KiB |
Quantile statistics
| Minimum | 1969 |
|---|---|
| 5-th percentile | 1986 |
| Q1 | 2005 |
| median | 2006 |
| Q3 | 2007 |
| 95-th percentile | 2008 |
| Maximum | 2008 |
| Range | 39 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 7.0675082 |
|---|---|
| Coefficient of variation (CV) | 0.0035260427 |
| Kurtosis | 10.083389 |
| Mean | 2004.374 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -3.2390583 |
| Sum | 1.9503761 × 108 |
| Variance | 49.949672 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2007 | 23362 | |
| 2008 | 23176 | |
| 2006 | 20064 | |
| 2005 | 19283 | |
| 1997 | 397 | 0.4% |
| 2004 | 386 | 0.4% |
| 1982 | 382 | 0.4% |
| 1996 | 372 | 0.4% |
| 1998 | 372 | 0.4% |
| 2000 | 360 | 0.4% |
| Other values (30) | 9152 | 9.4% |
| Value | Count | Frequency (%) |
| 1969 | 309 | |
| 1970 | 336 | |
| 1971 | 214 | |
| 1972 | 199 | |
| 1973 | 174 | |
| 1974 | 194 | |
| 1975 | 195 | |
| 1976 | 250 | |
| 1977 | 316 | |
| 1978 | 299 |
| Value | Count | Frequency (%) |
| 2008 | 23176 | |
| 2007 | 23362 | |
| 2006 | 20064 | |
| 2005 | 19283 | |
| 2004 | 386 | 0.4% |
| 2003 | 324 | 0.3% |
| 2002 | 338 | 0.3% |
| 2001 | 341 | 0.4% |
| 2000 | 360 | 0.4% |
| 1999 | 358 | 0.4% |
month
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.5483526 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 380.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.4114625 |
|---|---|
| Coefficient of variation (CV) | 0.52096499 |
| Kurtosis | -1.18096 |
| Mean | 6.5483526 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.02487014 |
| Sum | 637194 |
| Variance | 11.638076 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 8735 | |
| 7 | 8649 | |
| 9 | 8329 | |
| 6 | 8167 | |
| 5 | 8163 | |
| 3 | 8133 | |
| 10 | 8088 | |
| 12 | 8027 | |
| 1 | 7867 | |
| 4 | 7847 | |
| Other values (2) | 15301 |
| Value | Count | Frequency (%) |
| 1 | 7867 | |
| 2 | 7464 | |
| 3 | 8133 | |
| 4 | 7847 | |
| 5 | 8163 | |
| 6 | 8167 | |
| 7 | 8649 | |
| 8 | 8735 | |
| 9 | 8329 | |
| 10 | 8088 |
| Value | Count | Frequency (%) |
| 12 | 8027 | |
| 11 | 7837 | |
| 10 | 8088 | |
| 9 | 8329 | |
| 8 | 8735 | |
| 7 | 8649 | |
| 6 | 8167 | |
| 5 | 8163 | |
| 4 | 7847 | |
| 3 | 8133 |
is_male
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 95.2 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 49800 | |
| False | 47506 |
weight_pounds
Real number (ℝ)
| Distinct | 832 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.3915582 |
| Minimum | 1.12 |
|---|---|
| Maximum | 16.459999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 380.2 KiB |
Quantile statistics
| Minimum | 1.12 |
|---|---|
| 5-th percentile | 5.5599999 |
| Q1 | 6.6900001 |
| median | 7.3899999 |
| Q3 | 8.1099997 |
| 95-th percentile | 9.1899996 |
| Maximum | 16.459999 |
| Range | 15.339999 |
| Interquartile range (IQR) | 1.4199996 |
Descriptive statistics
| Standard deviation | 1.1006789 |
|---|---|
| Coefficient of variation (CV) | 0.14891027 |
| Kurtosis | 0.77834874 |
| Mean | 7.3915582 |
| Median Absolute Deviation (MAD) | 0.69999981 |
| Skewness | -0.089740433 |
| Sum | 719242.96 |
| Variance | 1.211494 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7.5 | 1800 | 1.8% |
| 7.369999886 | 1708 | 1.8% |
| 7.559999943 | 1706 | 1.8% |
| 7.25 | 1684 | 1.7% |
| 7.190000057 | 1663 | 1.7% |
| 7.440000057 | 1645 | 1.7% |
| 7.309999943 | 1626 | 1.7% |
| 7 | 1608 | 1.7% |
| 7.690000057 | 1570 | 1.6% |
| 7.630000114 | 1565 | 1.6% |
| Other values (822) | 80731 |
| Value | Count | Frequency (%) |
| 1.120000005 | 1 | |
| 1.25 | 1 | |
| 1.289999962 | 1 | |
| 1.5 | 2 | |
| 1.559999943 | 1 | |
| 1.75 | 1 | |
| 1.809999943 | 1 | |
| 1.940000057 | 1 | |
| 2 | 1 | |
| 2.059999943 | 2 |
| Value | Count | Frequency (%) |
| 16.45999908 | 1 | |
| 15.63000011 | 1 | |
| 14.36999989 | 1 | |
| 13.75 | 1 | |
| 13.31000042 | 1 | |
| 12.85999966 | 1 | |
| 12.43999958 | 2 | |
| 12.39000034 | 1 | |
| 12.18999958 | 1 | |
| 12.13000011 | 2 |
plurality
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 760.3 KiB |
| 1.0 | |
|---|---|
| 2.0 | 2245 |
| 3.0 | 28 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 291918 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 95033 | |
| 2.0 | 2245 | 2.3% |
| 3.0 | 28 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 95033 | |
| 2.0 | 2245 | 2.3% |
| 3.0 | 28 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 97306 | |
| 0 | 97306 | |
| 1 | 95033 | |
| 2 | 2245 | 0.8% |
| 3 | 28 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 291918 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 97306 | |
| 0 | 97306 | |
| 1 | 95033 | |
| 2 | 2245 | 0.8% |
| 3 | 28 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 291918 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 97306 | |
| 0 | 97306 | |
| 1 | 95033 | |
| 2 | 2245 | 0.8% |
| 3 | 28 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 291918 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 97306 | |
| 0 | 97306 | |
| 1 | 95033 | |
| 2 | 2245 | 0.8% |
| 3 | 28 | < 0.1% |
apgar_5min
Real number (ℝ)
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.9202413 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 20 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 760.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 9 |
| median | 9 |
| Q3 | 9 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.60029276 |
|---|---|
| Coefficient of variation (CV) | 0.067295574 |
| Kurtosis | 46.005977 |
| Mean | 8.9202413 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -4.8055186 |
| Sum | 867993 |
| Variance | 0.3603514 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 81860 | |
| 8 | 7639 | 7.9% |
| 10 | 5804 | 6.0% |
| 7 | 1158 | 1.2% |
| 6 | 382 | 0.4% |
| 5 | 198 | 0.2% |
| 4 | 98 | 0.1% |
| 3 | 67 | 0.1% |
| 1 | 40 | < 0.1% |
| 2 | 40 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 20 | < 0.1% |
| 1 | 40 | < 0.1% |
| 2 | 40 | < 0.1% |
| 3 | 67 | 0.1% |
| 4 | 98 | 0.1% |
| 5 | 198 | 0.2% |
| 6 | 382 | 0.4% |
| 7 | 1158 | 1.2% |
| 8 | 7639 | 7.9% |
| 9 | 81860 |
| Value | Count | Frequency (%) |
| 10 | 5804 | 6.0% |
| 9 | 81860 | |
| 8 | 7639 | 7.9% |
| 7 | 1158 | 1.2% |
| 6 | 382 | 0.4% |
| 5 | 198 | 0.2% |
| 4 | 98 | 0.1% |
| 3 | 67 | 0.1% |
| 2 | 40 | < 0.1% |
| 1 | 40 | < 0.1% |
mother_age
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 38 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.73424 |
| Minimum | 13 |
|---|---|
| Maximum | 50 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 380.2 KiB |
Quantile statistics
| Minimum | 13 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 23 |
| median | 28 |
| Q3 | 32 |
| 95-th percentile | 38 |
| Maximum | 50 |
| Range | 37 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 5.9244264 |
|---|---|
| Coefficient of variation (CV) | 0.21361416 |
| Kurtosis | -0.54868687 |
| Mean | 27.73424 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 0.21533498 |
| Sum | 2698708 |
| Variance | 35.098828 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 26 | 5878 | 6.0% |
| 28 | 5854 | 6.0% |
| 27 | 5832 | 6.0% |
| 29 | 5680 | 5.8% |
| 30 | 5515 | 5.7% |
| 25 | 5483 | 5.6% |
| 24 | 5384 | 5.5% |
| 31 | 5117 | 5.3% |
| 23 | 5025 | 5.2% |
| 22 | 4801 | 4.9% |
| Other values (28) | 42737 |
| Value | Count | Frequency (%) |
| 13 | 4 | < 0.1% |
| 14 | 57 | 0.1% |
| 15 | 234 | 0.2% |
| 16 | 645 | 0.7% |
| 17 | 1290 | 1.3% |
| 18 | 2305 | |
| 19 | 3365 | |
| 20 | 4002 | |
| 21 | 4290 | |
| 22 | 4801 |
| Value | Count | Frequency (%) |
| 50 | 6 | < 0.1% |
| 49 | 3 | < 0.1% |
| 48 | 4 | < 0.1% |
| 47 | 23 | < 0.1% |
| 46 | 33 | < 0.1% |
| 45 | 62 | 0.1% |
| 44 | 114 | 0.1% |
| 43 | 251 | |
| 42 | 402 | |
| 41 | 612 |
gestation_weeks
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.917543 |
| Minimum | 35 |
|---|---|
| Maximum | 43 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 380.2 KiB |
Quantile statistics
| Minimum | 35 |
|---|---|
| 5-th percentile | 36 |
| Q1 | 38 |
| median | 39 |
| Q3 | 40 |
| 95-th percentile | 41 |
| Maximum | 43 |
| Range | 8 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.5921891 |
|---|---|
| Coefficient of variation (CV) | 0.04091186 |
| Kurtosis | 0.17076838 |
| Mean | 38.917543 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.098785192 |
| Sum | 3786910.4 |
| Variance | 2.5350659 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 39 | 26496 | |
| 40 | 19692 | |
| 38 | 19406 | |
| 37 | 9263 | 9.5% |
| 41 | 9117 | 9.4% |
| 36 | 4504 | 4.6% |
| 42 | 3163 | 3.3% |
| 35 | 2544 | 2.6% |
| 43 | 1563 | 1.6% |
| 38.95470428 | 1558 | 1.6% |
| Value | Count | Frequency (%) |
| 35 | 2544 | 2.6% |
| 36 | 4504 | 4.6% |
| 37 | 9263 | 9.5% |
| 38 | 19406 | |
| 38.95470428 | 1558 | 1.6% |
| 39 | 26496 | |
| 40 | 19692 | |
| 41 | 9117 | 9.4% |
| 42 | 3163 | 3.3% |
| 43 | 1563 | 1.6% |
| Value | Count | Frequency (%) |
| 43 | 1563 | 1.6% |
| 42 | 3163 | 3.3% |
| 41 | 9117 | 9.4% |
| 40 | 19692 | |
| 39 | 26496 | |
| 38.95470428 | 1558 | 1.6% |
| 38 | 19406 | |
| 37 | 9263 | 9.5% |
| 36 | 4504 | 4.6% |
| 35 | 2544 | 2.6% |
cigarette_use
Boolean
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 95.2 KiB |
| False | |
|---|---|
| True | 3514 |
| Value | Count | Frequency (%) |
| False | 93792 | |
| True | 3514 | 3.6% |
alcohol_use
Boolean
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 95.2 KiB |
| False | |
|---|---|
| True | 3590 |
| Value | Count | Frequency (%) |
| False | 93716 | |
| True | 3590 | 3.7% |
weight_gain_pounds
Real number (ℝ)
| Distinct | 71 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31.834937 |
| Minimum | 1 |
|---|---|
| Maximum | 70 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 380.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 23 |
| median | 31 |
| Q3 | 39.622368 |
| 95-th percentile | 53 |
| Maximum | 70 |
| Range | 69 |
| Interquartile range (IQR) | 16.622368 |
Descriptive statistics
| Standard deviation | 12.359923 |
|---|---|
| Coefficient of variation (CV) | 0.38825029 |
| Kurtosis | 0.051579259 |
| Mean | 31.834937 |
| Median Absolute Deviation (MAD) | 8.6223679 |
| Skewness | 0.17735706 |
| Sum | 3097730.4 |
| Variance | 152.76772 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 39.62236786 | 7599 | 7.8% |
| 30 | 7211 | 7.4% |
| 40 | 4813 | 4.9% |
| 25 | 4702 | 4.8% |
| 20 | 4637 | 4.8% |
| 35 | 4303 | 4.4% |
| 32 | 2530 | 2.6% |
| 28 | 2353 | 2.4% |
| 50 | 2277 | 2.3% |
| 33 | 2224 | 2.3% |
| Other values (61) | 54657 |
| Value | Count | Frequency (%) |
| 1 | 165 | 0.2% |
| 2 | 226 | 0.2% |
| 3 | 232 | 0.2% |
| 4 | 272 | 0.3% |
| 5 | 444 | 0.5% |
| 6 | 348 | 0.4% |
| 7 | 469 | 0.5% |
| 8 | 507 | 0.5% |
| 9 | 404 | 0.4% |
| 10 | 1297 |
| Value | Count | Frequency (%) |
| 70 | 265 | |
| 69 | 57 | 0.1% |
| 68 | 99 | 0.1% |
| 67 | 82 | 0.1% |
| 66 | 110 | 0.1% |
| 65 | 306 | |
| 64 | 123 | |
| 63 | 164 | |
| 62 | 151 | |
| 61 | 176 |
ever_born
Real number (ℝ)
SKEWED 
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.0779089 |
| Minimum | 1 |
|---|---|
| Maximum | 231 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 380.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 231 |
| Range | 230 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.9043474 |
|---|---|
| Coefficient of variation (CV) | 0.91647302 |
| Kurtosis | 8582.9336 |
| Mean | 2.0779089 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 71.857765 |
| Sum | 202193 |
| Variance | 3.6265392 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 37836 | |
| 2 | 32574 | |
| 3 | 16472 | |
| 4 | 6439 | 6.6% |
| 5 | 2266 | 2.3% |
| 6 | 879 | 0.9% |
| 7 | 407 | 0.4% |
| 8 | 342 | 0.4% |
| 9 | 26 | < 0.1% |
| 10 | 24 | < 0.1% |
| Other values (6) | 41 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 37836 | |
| 2 | 32574 | |
| 3 | 16472 | |
| 4 | 6439 | 6.6% |
| 5 | 2266 | 2.3% |
| 6 | 879 | 0.9% |
| 7 | 407 | 0.4% |
| 8 | 342 | 0.4% |
| 9 | 26 | < 0.1% |
| 10 | 24 | < 0.1% |
| Value | Count | Frequency (%) |
| 231 | 4 | < 0.1% |
| 15 | 3 | < 0.1% |
| 14 | 5 | < 0.1% |
| 13 | 4 | < 0.1% |
| 12 | 8 | < 0.1% |
| 11 | 17 | < 0.1% |
| 10 | 24 | < 0.1% |
| 9 | 26 | < 0.1% |
| 8 | 342 | |
| 7 | 407 |
father_age
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 44 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.266664 |
| Minimum | 13 |
|---|---|
| Maximum | 56 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 380.2 KiB |
Quantile statistics
| Minimum | 13 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 25 |
| median | 30 |
| Q3 | 35 |
| 95-th percentile | 42 |
| Maximum | 56 |
| Range | 43 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 6.7621399 |
|---|---|
| Coefficient of variation (CV) | 0.22341874 |
| Kurtosis | 0.075070611 |
| Mean | 30.266664 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.4634618 |
| Sum | 2945128 |
| Variance | 45.726536 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 29 | 5663 | 5.8% |
| 28 | 5549 | 5.7% |
| 30 | 5541 | 5.7% |
| 31 | 5481 | 5.6% |
| 27 | 5319 | 5.5% |
| 26 | 5132 | 5.3% |
| 32 | 5073 | 5.2% |
| 33 | 4754 | 4.9% |
| 25 | 4745 | 4.9% |
| 34 | 4556 | 4.7% |
| Other values (34) | 45493 |
| Value | Count | Frequency (%) |
| 13 | 1 | < 0.1% |
| 14 | 14 | < 0.1% |
| 15 | 53 | 0.1% |
| 16 | 193 | 0.2% |
| 17 | 455 | 0.5% |
| 18 | 1025 | 1.1% |
| 19 | 1692 | |
| 20 | 2363 | |
| 21 | 2962 | |
| 22 | 3577 |
| Value | Count | Frequency (%) |
| 56 | 45 | < 0.1% |
| 55 | 50 | 0.1% |
| 54 | 65 | 0.1% |
| 53 | 80 | 0.1% |
| 52 | 123 | 0.1% |
| 51 | 125 | 0.1% |
| 50 | 189 | |
| 49 | 237 | |
| 48 | 311 | |
| 47 | 392 |
baby_alive
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 95.2 KiB |
| True | |
|---|---|
| False | 7385 |
| Value | Count | Frequency (%) |
| True | 89921 | |
| False | 7385 | 7.6% |
| alcohol_use | apgar_5min | baby_alive | cigarette_use | ever_born | father_age | gestation_weeks | is_male | month | mother_age | plurality | weight_gain_pounds | weight_pounds | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| alcohol_use | 1.000 | 0.022 | 0.032 | 0.989 | 0.000 | 0.055 | 0.021 | 0.000 | 0.009 | 0.072 | 0.007 | 0.042 | 0.067 | 0.056 |
| apgar_5min | 0.022 | 1.000 | 0.031 | 0.022 | 0.046 | -0.000 | 0.038 | 0.015 | -0.003 | -0.010 | 0.016 | -0.002 | 0.031 | -0.083 |
| baby_alive | 0.032 | 0.031 | 1.000 | 0.026 | 0.019 | 0.057 | 0.009 | 0.000 | 0.006 | 0.068 | 0.000 | 0.051 | 0.015 | 0.202 |
| cigarette_use | 0.989 | 0.022 | 0.026 | 1.000 | 0.000 | 0.056 | 0.021 | 0.000 | 0.008 | 0.074 | 0.006 | 0.043 | 0.070 | 0.065 |
| ever_born | 0.000 | 0.046 | 0.019 | 0.000 | 1.000 | 0.288 | -0.101 | 0.000 | 0.000 | 0.335 | 0.000 | -0.134 | 0.055 | -0.013 |
| father_age | 0.055 | -0.000 | 0.057 | 0.056 | 0.288 | 1.000 | -0.049 | 0.000 | 0.003 | 0.781 | 0.037 | -0.068 | 0.065 | 0.050 |
| gestation_weeks | 0.021 | 0.038 | 0.009 | 0.021 | -0.101 | -0.049 | 1.000 | 0.030 | -0.000 | -0.064 | 0.167 | 0.046 | 0.315 | -0.023 |
| is_male | 0.000 | 0.015 | 0.000 | 0.000 | 0.000 | 0.000 | 0.030 | 1.000 | 0.000 | 0.007 | 0.005 | 0.028 | 0.117 | 0.000 |
| month | 0.009 | -0.003 | 0.006 | 0.008 | 0.000 | 0.003 | -0.000 | 0.000 | 1.000 | 0.003 | 0.000 | -0.020 | -0.009 | -0.002 |
| mother_age | 0.072 | -0.010 | 0.068 | 0.074 | 0.335 | 0.781 | -0.064 | 0.007 | 0.003 | 1.000 | 0.050 | -0.076 | 0.083 | 0.059 |
| plurality | 0.007 | 0.016 | 0.000 | 0.006 | 0.000 | 0.037 | 0.167 | 0.005 | 0.000 | 0.050 | 1.000 | 0.070 | 0.217 | 0.011 |
| weight_gain_pounds | 0.042 | -0.002 | 0.051 | 0.043 | -0.134 | -0.068 | 0.046 | 0.028 | -0.020 | -0.076 | 0.070 | 1.000 | 0.142 | -0.087 |
| weight_pounds | 0.067 | 0.031 | 0.015 | 0.070 | 0.055 | 0.065 | 0.315 | 0.117 | -0.009 | 0.083 | 0.217 | 0.142 | 1.000 | -0.020 |
| year | 0.056 | -0.083 | 0.202 | 0.065 | -0.013 | 0.050 | -0.023 | 0.000 | -0.002 | 0.059 | 0.011 | -0.087 | -0.020 | 1.000 |
| year | month | is_male | weight_pounds | plurality | apgar_5min | mother_age | gestation_weeks | cigarette_use | alcohol_use | weight_gain_pounds | ever_born | father_age | baby_alive | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2005 | 2 | False | 5.37 | 2.0 | 8.0 | 29 | 38.0 | false | false | 8.0 | 10.0 | 31 | True |
| 1 | 2005 | 10 | True | 6.88 | 1.0 | 9.0 | 30 | 40.0 | false | false | 25.0 | 1.0 | 50 | False |
| 2 | 2005 | 10 | True | 6.76 | 1.0 | 9.0 | 19 | 38.0 | false | false | 25.0 | 1.0 | 24 | True |
| 3 | 2005 | 10 | False | 8.69 | 1.0 | 9.0 | 27 | 39.0 | false | false | 47.0 | 1.0 | 30 | True |
| 4 | 2005 | 9 | False | 7.00 | 1.0 | 9.0 | 20 | 40.0 | false | false | 42.0 | 1.0 | 30 | True |
| 5 | 2005 | 9 | False | 8.06 | 1.0 | 9.0 | 35 | 40.0 | false | false | 12.0 | 1.0 | 41 | True |
| 6 | 2005 | 2 | True | 5.18 | 1.0 | 8.0 | 24 | 36.0 | false | false | 9.0 | 1.0 | 24 | True |
| 7 | 2005 | 5 | False | 7.55 | 1.0 | 9.0 | 32 | 40.0 | false | false | 30.0 | 1.0 | 42 | True |
| 8 | 2005 | 1 | False | 5.89 | 1.0 | 9.0 | 30 | 37.0 | false | false | 31.0 | 1.0 | 35 | False |
| 9 | 2005 | 11 | True | 5.00 | 1.0 | 9.0 | 18 | 37.0 | false | false | 63.0 | 1.0 | 24 | False |
| year | month | is_male | weight_pounds | plurality | apgar_5min | mother_age | gestation_weeks | cigarette_use | alcohol_use | weight_gain_pounds | ever_born | father_age | baby_alive | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 97296 | 1970 | 2 | True | 8.38 | 1.0 | 9.0 | 27 | 38.954704 | false | false | 39.622368 | 6.0 | 22 | False |
| 97297 | 1970 | 3 | False | 7.19 | 1.0 | 9.0 | 28 | 38.954704 | false | false | 39.622368 | 5.0 | 31 | False |
| 97298 | 1970 | 9 | False | 6.94 | 1.0 | 9.0 | 23 | 38.954704 | false | false | 39.622368 | 4.0 | 33 | True |
| 97299 | 1970 | 6 | True | 7.50 | 1.0 | 9.0 | 42 | 38.954704 | false | false | 39.622368 | 4.0 | 47 | True |
| 97300 | 1970 | 7 | True | 6.69 | 1.0 | 9.0 | 28 | 38.954704 | false | false | 39.622368 | 7.0 | 54 | True |
| 97301 | 1971 | 9 | True | 8.62 | 1.0 | 9.0 | 20 | 38.954704 | false | false | 39.622368 | 1.0 | 21 | True |
| 97302 | 1971 | 11 | False | 6.38 | 1.0 | 9.0 | 19 | 38.954704 | false | false | 39.622368 | 1.0 | 21 | True |
| 97303 | 1971 | 12 | False | 7.25 | 1.0 | 9.0 | 19 | 38.954704 | false | false | 39.622368 | 1.0 | 22 | True |
| 97304 | 1971 | 6 | False | 6.62 | 1.0 | 9.0 | 20 | 38.954704 | false | false | 39.622368 | 1.0 | 26 | True |
| 97305 | 1971 | 5 | True | 6.44 | 1.0 | 9.0 | 18 | 38.954704 | false | false | 39.622368 | 1.0 | 20 | True |
Most frequently occurring
| year | month | is_male | weight_pounds | plurality | apgar_5min | mother_age | gestation_weeks | cigarette_use | alcohol_use | weight_gain_pounds | ever_born | father_age | baby_alive | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2006 | 9 | False | 7.69 | 1.0 | 9.0 | 28 | 39.0 | false | false | 30.0 | 2.0 | 29 | True | 2 |